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Abstract 

We study random instances of the weighted d-CNF satisfiability problem (WEIGHTED d- 
SAT), a generic W[l]-complete problem. A random instance of the problem consists of a fixed 
parameter k and a random cZ-CNF formula generated as follows: for each subset of d 
variables and with probability p, a clause over the d variables is selected uniformly at random 
from among the 2 d — 1 clauses that contain at least one negated literals. 

We show that random instances of WEIGHTED d-SXT can be solved in 0(k 2 n + n°^)- 
time with high probability, indicating that typical instances of WEIGHTED d-SAT under this 
instance distribution are fixed-parameter tractable. The result also hold for random instances 
from the model ^Fu'Sid') where clauses containing less than d'(l < d' < d) negated literals are 
forbidden, and for random instances of the renormalized (miniaturized) version of WEIGHTED 
d-S AT in certain range of the random model's parameter p(n) . This, together with our previous 
results on the threshold behavior and the resolution complexity of unsatisfiable instances of 
tfe'd' provides an almost complete characterization of the typical-case behavior of random 
instances of WEIGHTED d-S AT. 

1 Introduction 

The theory of parameterized complexity and fixed-parameter algorithms is becoming an active re- 
search area in recent years [[SEE]]. Parameterized complexity provides a new perspective on hard 
algorithmic problems, while fixed-parameter algorithms have found applications in a variety of ar- 
eas such as artificial intelligence, computational biology, cognitive modeling, graph theory, and 
various optimization problems. 

The study of the typical-case behavior of random instances of NP-complete problems and coNP- 
complete problems such as satisfiability (SAT) and graph coloring has had much impact on our 
understanding of the nature of hard problems as well as the strength and weakness of algorithms 
and well-founded heuristics [fl~l[3l|5]|7]]. Designing polynomial-time algorithms that solve random 
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instances of NP-complete problems under various random distributions has also been an active 
research area. 

In this work, we extend this line of research to intractable parameterized problems. We study 
random instances of the weighted d-CNF satisfiability problem (WEIGHTED d-SAT), a generic 
W[l]-complete parameterized problem. An instance of WEIGHTED d-SAT consists of a d-CNF 
formula T and a fixed parameter k > 0. The question is to decide if there is a satisfying assignment 
with Hamming distance k to the all-zero assignment. A variant of WEIGHTED d-SAT is MINI- 
WEIGHTED d-S AT that asks if there is a satisfying assignment with Hamming distance k log n to 
the all-zero assignment. 

We show that there is an 0(k 2 n + n°W)-time algorithm that solves random instances of 
WEIGHTED d-S AT with high probability for any p(n) = C J°^" . The result also hold for random 
instances from the more general model where clauses containing less than d'(l < d' < d) 

negated literals are forbidden, and for random instances of MINI -WEIGHTED d-S AT with the ran- 
dom model's parameter p{n) being in a certain range. This, together with our previous results on the 
threshold behavior and resolution complexity of unsatisfiable instances of J^'j in iPTO . provides a 
nearly complete characterization of the typical-case behavior of random instances of WEIGHTED 
d-SAT. To the best knowledge of the author, this is the first work in the literature on the fixed- 
parameter tractability of random instances of a W[l]-complete problem. 

The main result of this paper is that instances from the random distribution (and its general- 
ization ^'J(d')) of WEIGHTED d-SAT are "typically" fixed-parameter tractable for any p = 
with c > 0. 

Theorem 1 There is an 0(k 2 n + n°^)-time algorithm that with high probability, either finds a 
satisfying assignment of weight k or reports that no such assignment exists for a random instance 
(?kS> k ) of WEIGHTED d-SAT for any p = with c> 0. 

In the appendices, we show that the same algorithm can be extended to solve random instances 
from the more general model J^?'^{dl) and random instances of MINI-WEIGHTED d-SAT for 
certain range of the probability parameter p(n). 

The next section contains necessary preliminaries and a detailed description of the random 
model. In Section 3, we present the algorithm W-SAT together with a discussion on its time com- 
plexity. In Section 4, we prove that W-SAT succeeds with high probability for random instances of 
WEIGHTED d-SAT. In the last section, we discuss directions for future work. 

2 Preliminaries and Random Models for WEIGHTED d-SAT 

An instance of a parameterized decision problem is a pair (I, k) where / is a problem instance and k 
is the problem parameter OCEl. Usually, the parameter k either specifies the "size" of the solution 
or is related to some structural property of the underlying problem, such as the treewidth of a graph. 
A parameterized problem is fixed-parameter tractable (FPT) if any instance (I, k) of the problem 
can be solved in f(k)\I\°^ time, where f(k) is a computable function that depends only on k. 
Parameterized problems are inter-related by parameterized reductions, resulting in a classification 
of parameterized problems into a hierarchy of complexity classes FPT C W[l] C W[2] ■ ■ ■ C XP. 
It is believed that the inclusions are strict and the notion of completeness can be naturally defined 
via parameterized reductions. 
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2.1 Weighted CNF Satisfiability and its Random Model 



As with the theory of NP-completeness, the satisfiability problem plays an important role in the 
theory of parameterized complexity. A CNF formula (over a set of Boolean variables) is a conjunc- 
tion of disjunctions of literals. A d-clause is a disjunction of d-literals. A d-CNF formula is a CNF 
formula that consists of d-clauses only. An assignment to a set of n Boolean variables is a vector in 
{TRUE, FALSE}™. The weight of an assignment is the number of variables that are set to TRUE by 
the assignment. It is convenient to identify TRUE with 1 and FALSE with 0. Thus, an assignment 
can also be regarded as a vector in {0, l} n and the weight of an assignment is just its Hamming 
distance to the all-zero assignment. 

A representative TV [1] -complete problem is the following weighted d-CNF satisfiability (WEIGHTED 
d-SAT) problem: 

Problem 1 WEIGHTED d-SAT 

Instance: A CNF formula consisting of d-clauses, and a positive integer k. 

Question: Is there a satisfying assignment of weight k? 

In lfl4l . Marx studied the parameterized complexity of the more general parameterized Boolean 
constraint satisfaction problem. One of the results of Marx ([ 14 ], Lemma 4.1), when applied to CNF 
formulas, is that any instance of WEIGHTED d-SAT can be reduced to at most d k instances each of 
which is a conjunction of clauses that contain at least one negated literal. Marx further proved that 
WEIGHTED d-SAT is W[l]-complete even when restricted to CNF formulas that consist of clauses 
of the form x\/ y. 

We use G(n,p) to denote the Erdos-Renyi random graph where n is the number of vertices 
and p is the edge probability In G(n,p), each of the possible (™J edges appears independently 
with probability p. A random hyper-graph Q(n,p, d) is a hypergraph where each of the pos- 
sible hyperedges appears independently with probability p. Throughout the paper, by "with high 
probability" we mean that the probability of the event under consideration is 1 — o(l). 

We will be working with the following random model of WEIGHTED d-SAT, which is basically 
similar in spirit to random CNF formulae with a planted solution studied in traditional (constraint) 
satisfiability (See, e.g., J2H9JQI3EE21Q2JQ2] and the references therein). 

Definition 2.1 Let X = {x\, • • • , x n } be a set of Boolean variables andp = p{n) be a function of 
n. Let k and d be two positive constants. 

We define a random model J^d f or WEIGHTED d-SAT parameterized by k as follows: To 
generate an instance T from TZ^, we first construct a random hypergraph Q(n,p, d) using X as 
the vertex set. For each hyperedge {x{ l , • • • , Xi d }, we include in T a d-clause selected uniformly at 
random from the set of2 d — \ non-monotone d-clauses defined over the variables {x^ , • • • , Xi d }. 
(A monotone clause is a clause that contains positive literals only). 

The model can be generalized to J~Tf(d') as follows: instead of from the set of non- 
monotone clauses, we select uniformly at random from the set of clauses over {x^ , • • • , Xi d } that 
contain at least d' negated literals. Note that •T-T'J is just ^Fu'diX)- m tne rest of this paper, we will 
be focusing on J^'$ , but will discuss how the algorithm and the results can be adapted to J^'^(d') 
in Appendix A. 
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Note that since monotone clauses are excluded, the all-zero assignment always satisfies a ran- 
dom instance of •T"?'? in the traditional sense. On the other hand, in view of Marx's results we 
mentioned earlier in this subsection, forbidding monotone clauses is not really a restriction. As a 
matter of fact, our study begins with a random model that doesn't pose any restriction on the type of 
clauses that can appear in a formula. Such a model, however, turns out to be trivially unsatisfiable 
since unless the model parameter p(n) is extremely small, a random instance will contain more than 
2k independent monotone clauses. 

2.2 Residual Graphs of CNF Formulas and Induced Formulas 

Associated with a CNF formula is its residual graph over the set of variables involved in the 
formula. There is an edge between two variables if they both occur in some common clause. The 
residual graph of a random instance of J-^f is the random graph G(n,p). The residual graph of a 
random instance of is the primal graph of the random hypergraph Q(n,p, d). 

Let T be a d-CNF formula and V C X be a subset of variables. The induced formula Ty of 
T over V is defined to be the CNF formula Ty that consists of the following two types of clauses: 

1. the clauses in T that only involve the variables in V; 

2. the clauses of size at least 2 obtained by removing any literal whose corresponding variables 
are in X \ V. 

3 A Fixed-Parameter Algorithm for Instances of T^ v d 

In this section, we describe the details of the fixed-parameter algorithm designed for random in- 
stances of J 7 ^^ and show that its time complexity is 0(k 2 n + n°( l >). The results in this section and 
in the next section together establish Theorem [T] 

3.1 General Idea 

We describe the general idea in terms of WEIGHTED 2-SAT. A detailed description of the algorithm 
for is given in the next subsection. The generalization of the algorithm to the more general 
random model J^'^(d') is presented in Appendix A. 

The algorithm W-SAT considers all the variables x that appears in more than k + 1 clauses of 
the form x V y. Any such variable cannot be assigned to TRUE. By assigning these forced variables 
to FALSE, we get a reduced formula. W-SAT then checks to see if the reduced formula can be 
decomposed into connected components of size at most log n. If no such decomposition is possible, 
W-SAT gives up. Otherwise, let {Ti, 1 < i < m} be the collection of connected components in the 
reduced formula. For each connected component Ti, use brute-force to find the set of integers L% 
such that for each k' G Li, there is an assignment of weight k' to the variables in T% that satisfies 

Finally, a dynamic programming algorithm is applied to find in time 0(k 2 n) a collection of at 
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most k positive integers {ki j , 1 < j < k} such that 

f ki 3 e , and 

|^ ki 1 + k{ 2 + • • • + ki k = k 

Combining the weight- fc^ solutions to the subproblems indexed by ij, a weight- A; solution can be 
found. If on the other hand, no such {ki j , 1 < j < k} can be found, we can safely report that the 
original instance has no weight-A; satisfying assignment. 

3.2 Details of the Algorithm W-SAT 

We first introduce the following concept that is essential to the algorithm: 

Definition 3.1 Let (J 7 , k) be an instance of WEIGHTED d-SAT where T is a d-CNF formula and 
k is the parameter. Consider a variable x and a collection of subsets of variables y = {Yi, 1 < i < 
k} where Yi = {yij, 1 < j < (d — 1)} is a subset of X \ {x}. We say that the collection y freezes 
x if the following two conditions are satisfied: 

1. Yi n Yj = 0, Vi, j. 

2. for each 1 < i < k, the clause x V yn V ■ ■ ■ V y^d-i) i s m the formula T. 

A variable x is said to be k-frozen with respect to a subset of variables V if it is frozen by a collection 
of subsets of variables {Yi, 1 < i < k} such that Yi C V, VI < i < k. A variable that is k-frozen 
with respect to the set of all variables is simply called a k-frozen variable. 

It is obvious that a /c-frozen variable cannot be assigned to TRUE without forcing more than k other 
variables to be TRUE. We also need the following concept to describe the algorithm: 

Definition 3.2 Let T be a CNF formula. We use Ljr to denote the set of integers between and k 
such that for each kl G Lp, there is a satisfying assignment of weight k! for T. 

The algorithm W-SAT is described in Algorithm 1. We explain in the following the purpose 
of the subroutine REDUCE(). The subroutine REDUCER, U) simplifies the formula T after the 
variables in U have been set to 0. It works in the same way as the unit-propagation based inference 
in the well-known DPLL procedure for traditional satisfiability search: It removes any clause that 
is satisfied by the assignment to the variables in U; deletes all the occurrences of a literal that has 
become FALSE due to the assignment; and assigns a proper value to the variables that are forced 
due to the literal-deletion. The procedure terminates when there is no more forced variable. It is 
easy to see the following lemma holds for the subroutine REDUCE(): 

Lemma 3.1 REDUCE( ) never assigns TRUE to a variable. If T' = REDUCE^, U) is empty, then 
T has a weight-k satisfying assignment if and only if at least k variables have not been assigned by 
REDUCED. 
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Algorithm 1 W-SAT 



Input: An instance (J 7 , k) of WEIGHTED d-SAT 

Output: A satisfying assignment of weight k, or UNSAT, or FAILURE 

1: Find the set of fc-frozen variables U and assign them to FALSE. 

2: Let T' = REDUCE^, U) be the reduced formula. 

3: Find the connected components {T\, • • • , Tm\ of T' . 

4: If there is a connected component of size larger than log n, return "FAILURE". 

5: Otherwise, for each connected component T\, use brute force to find Lj^.. 

6: Find a set of at most k indices {ij, 1 < j < k} and a set integers , 1 < j < k} such that 

k 

kij G Ljr. and ki 3 = k. Return "UNSAT" if there is no such index set. 

*' 3=1 

7: For each , use brute-force to find a weight-fc^ assignment to the variables in that satisfies 

8: Combine the assignments found in the above to form a weight-fc satisfying assignment to the 
formula T. 



3.3 Correctness and Time Complexity of W-SAT 

The correctness follows directly from the previous discussion. For the time complexity, we have the 
following 

Proposition 3.1 The running time of W-SAT is in 0{k 2 n + n°^). 

Proof. Since Lines 1 through 4, Line 5, and Line 7 together take time, we only need to show 

that Line 6 can be done in 0{k 2 n) time using dynamic programming. Consider an integer k and 

a collection {Li, 1 < i < m} where each Lj is a subset of integers in {0, 1, • • • , k}. We say that 

an integer a is achievable by {Li, 1 < i < m} if there is a set of indices I a = {ij, 1 < j < 1} 

i 

such that for each ij, there is a h Lj £ L; Lj so that Yl = k- We call any such an index set I a a 

i=i 

representative set of a. The purpose of Line 6 is to check to see if the integer k is achievable, and 
if YES, return a representative set of k. The Proposition follows from the follow lemma. ■ 

Lemma 3.2 Given a collection {Li, I < i < m} and an integer k where each Li is a subset of 
integers in {0, 1, ■ ■ ■ , k}, there is a dynamic programming algorithm that finds a representative set 
ofkifk is achievable, or reports that k is not achievable. It runs in time 0(k 2 m). 

Proof. Let A(t) = {(a, I a ) : < a < k} be the set of pairs (a, I a ) where < a < k is an integer 
achievable by {Li, 1 < i < t} and I a is a representative set of a. 

Let A(0) = 0. We see that A(t + 1) consists of the pairs of the form ((a + b),I a ) satisfying 

(o,J ) G A(t), 

b e L t+ i such that b < k — a, and 
I a = I a U{t}. 

A typical application of dynamic programming builds A(0),A(1), ■ ■ ■ , and A(m). The value k is 
achievable by {Li, 1 < i < m} if and only if there is a pair (k, Ik) in A(m). Since the size of A(t) 
is at most k, the above algorithm runs in 0(k 2 m) time. ■ 
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4 Algorithm W-SAT Succeeds With High Probability 



In this section, we prove that the algorithm W-SAT succeeds with high probability on random in- 
stances of J-'kd- Due to Proposition |3.1 1 we only need to show that W-SAT reports "FAILURE" 
with probability asymptotic to zero. Recall that W-SAT fails only when the reduced formula T' ob- 
tained in Line 2 has a connected component of size at least log n. The rest of this section is devoted 
to the proof of the following Proposition: 

Proposition 4.1 Let T = Fl'j be the input random CNF formula to W-SAT. With high probability, 
the residual graph of the induced formula Ty on V decomposes into a collection of connected 
components of size at most log n, where V is the set of variables that are not k-frozen. 

Proof. Let X = {xi, • • • , x n } be the set of Boolean variables, and let U be the set of A>frozen 
variables so that V = X \ U. Since p = ^d-" with c > 0, there will be many fc-frozen variables 
so that the size of U is large. If U were a randomly-selected subset of variables, the proposition is 
easy to prove. The difficulty in our case is that U is not randomly-selected, and consequently Ty 
cannot be assumed to be distributed in the same manner as the input formula T . 

To get around this difficulty, we instead directly upper bound the probability P* that the residual 
graph of T?'% contains as its subgraph a tree T over a given set Vp of log n variables such that every 
variable x G Vp is not fc-frozen. Since the variables in Ty are not fc-frozen, an upper bound on P* 
is also an upper bound on the probability that the residual graph of Ty contains as its subgraph a 
tree of the size log n. We then use this upper bound together with Markov's inequality to show that 
the probability that the residual graph of Ty has a connected component of size at least log n tends 
to zero. 

Let T be a fixed tree over a subset Vp of log n variables. The difficulty in estimating P* is that 
the event that the residual graph of TT'j contains T as its subgraph and the event that no variable 
in T is /c-frozen are not independent of each other. To decouple the dependency, we consider the 
following two events: 

1. A: the event that the residual graph of T?'% contains the tree T as its subgraph; and 

2. B: the event that none of the variables in Vp is ^-frozen with respect to X \ Vp. 

Since by definition, being /c-frozen with respect to a subset of variables implies being A;-frozen with 
respect to all variables, we have 

P* <F{AnB} . (4.1) 

We now claim that 

Lemma 4.1 The two events A and B are independent, i.e., 

¥{A\B] = ¥{B} (4.2) 

Proof. Note that the event A depends only on those d-clauses that contain at least two variables 
in Vp and that the event B depends only on those d-clauses that contain exactly one variable in Vp. 
Due to the definition of the random model J = 2 ,1 j, the appearance of a clause defined over a d-tuple 
of variables is independent from the appearance of the other clauses. The lemma follows. ■ 



Based on Equation (4.1 1 and Lemma 4.1 we only need to estimate P {^4} and P {B}. The 



following lemma bounds the probability that a variable is not /c-frozen. 
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Lemma 4.2 Let x be a variable and W C X such that x G W and \ W\ > n — log n. We have 

1 log^ n 

P {x is not k-frozen with respect to W\ < 0(1) max(— =•, ) 

n° n 

where < 5 < ^-g^ . 



Proof. Let be the number of clauses of the form x V yi V ■ ■ ■ V y^-i with {yi, • • • , yd-l} C 
X \ Vt- Due to the definition of J^f, the random variable N x follows the binomial distribution 
Bin(p, m) where p = and m = 

Write a — (2 d — l^d— i)! ' Chernoff bound (see Appendix B), we have 



{N x <k} < 2e 3pm < 0{k)e 



■ f log n 



E 0(rT 5 ) ( where < 5 < ^). (4.3) 

Let D be the event that in the random formula T, there are two clauses 

x Vyn V • • • V 2/i(d_i), and 
xVyx 2 V Vy 2 (d-i) 

such that {yn, • • • , yi(d-i)} n {yi2, • • • , y2(d-l)} 7^ 0- The total number of such pairs of clauses 
is at most 

, n — log n\ (n — log n x 
(d-l)f 



d-1 J V d - 2 
The probability for a specific pair to be in the random formula is 

1 clogn x " 
2 d -l n d - x 



By Markov's inequality, we have 



•{P}G0( l0gV 



n 

Since the probability that the variable x is not /c-frozen is at most 

P{{N X < k}UV} , 

the lemma follows. 



From Lemma |4T2] we have 
Lemma 4.3 For sufficiently large n, 

F{B} < O(l) (n- s \ (4.4) 
for some Q<6 < min( 3(2d _ 1 c )(d _ 1) , , 1). 
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Proof. Let E x be the event that a variable x 6 Vt is not /c-frozen with respect to X \ Vt- Since 
|Vt| = log re, the bound obtained in Lemma 4.2 applies to W = X \ Vt- Since for any x G Vjs the 



event E x only depends on the existence of clauses of the form 

x Vyu V ••• Vy i(d _i) 

with {yji, • • • iViU-t)} C X \ Vt, we see that the collection of the events {E x ,x € Vt} are 
mutually independent. The Lemma follows from Lemma |4~2| ■ 

Next, we have the following bound on the probability P {A}. 



Lemma 4.4 



{A} < O(l)(logn) losn n- los? 



Proof. Recall that A is the event that a random instance of induces all the edges of a fixed 
tree T with vertex set Vt of size log re. We follow the approach developed in (6l [lOl [131 and extend 
the counting argument from 3-clauses to the general case of d-clauses with d > 2. 

Let Ft be a set of clauses such that every edge of T is induced by some clause in Ft- We say 
that Ft is minimal if deleting any clause from it leaves at least one edge of T uncovered. 

Consider the different ways in which we can cover the edges of T by clauses. Treat the clauses 
in Ft as being grouped into d — 1 different groups {Si,l < i < (d— 1)}.A clause in the group Si 
is in charge of covering exactly i edges of T. Note that a clause in the group Si may "accidently" 
cover other edges that are not its responsibility. As long as each clause has its own dedicated set of 
edges to cover, there won't be any risk of under-counting. 

Let Si = |5j|, 1 < i < d— 1. We see that < Si < log n/i. Since each clause in Si is dedicated 
to i edges and there are in total log n — 1 edges, we have 



d-l 

E 



IS; 



log n — 1 . 



(4.5) 



i=l 



Counting very crudely, there are at most 



/log n\ 



ways to pick the dedicated sets of i edges for the 



Si clauses in group S{. Since T is a tree, for each set of i edges there are at most (2 d — 1) 

ways to select the corresponding clauses. Therefore, by Markov's inequality, we have that P {A} 
can be upper bounded by 



E 

0<Si<log n 



(logra)* (2 d -l)» 



n 



•£(d-i-i) Si / dogn 1 



<o(i) Yl ^ 

0<Si<logn 



n) losn n 



2 d -ln d 

E(-* s ») 



and due to Equation ( |4.5[ ), we have 

F{A} < 0(l)(logn) d (logn) logn 



n 



< 0(l)(logn 



i 21ogTl n _logn 



;n+l 
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This proves Lemma [44] 



Continuing the proof of Proposition 4. 1 we combine Lemma 4.3 and Lemma 4.4 to get 



'{AnB}< O(l)(logn) 21osn n- logn 



n 



log n 



Since the total number of trees of size logra is at most n logn (logn) logri 2 , the probability that the 
residual graph of Ty contains a tree of size log n is 



n 



log n 



(logn) logn - 2 P{^nfi} 



< Oil 



)(logn) 31ogn (V* 5 ) 



log n 



(4.6) 



Proposition 4. 1 follows. 



Proof. [Proof of Theorem [TJ To use Proposition 4.1 to prove that the algorithm W-SAT succeeds 
with high probability, we note that the reduced formula T' in Line 2 of the algorithm W-SAT is 
sparser than the induced formula Ty. In fact, it is easy to see that T' is an induced sub-formula of 
Ty over the set of variables that have not been assigned by the subroutine REDUCE(). Therefore by 
Proposition 4.1 with high probability T 1 decomposes into a collection of connected components, 
each of size at most log n. It follows that W-SAT succeeds with high probability. 

Combining all the above, we conclude that the algorithm W-SAT is a fixed-paramter algorithm 
and succeeds with high probability on random instances of 2^'%. This proves TheoremjTJ ■ 



5 Discussions 



The results presented in this paper, together with our previous results on the threshold behavior 
and the resolution complexity of unsatisfiable instances of J^f in IPTT1 . provides a first probabilis- 
tic analysis of W[l] -complete problems. For WEIGHTED 2-SAT and MINI-WEIGHTED 2-SAT, 
the behavior of random instances from the studied instance distribution is fully characterized. For 
WEIGHTED d-SAT with d > 2, the characterization is almost complete except for a small range 
of the probability parameter where the parametric resolution complexity is missing. In summary, 
random instances of WEIGHTED d-SAT from the random model under consideration are "typi- 
cally" fixed-parameter tractable, and hard instances (in the sense of fixed-parameter tractability) are 
expected only for MINI-WEIGHTED d-SAT. 

While we believe the random model ^'j(d') is very natural, we feel that it is challenging to 
come up with any alternative and natural instance distributions for weighted d-CNF satisfiability 
that are interesting and hard in terms of the complexity of typical instances. 

On the other hand, there are still many interesting questions with the model J-^'j. First, the 
behavior of random instances of MINI-WEIGHTED d-SAT with d > 2 is interesting due to the 
relation between such parameterized problems and the exponential time hypothesis of the satisfi- 
ability problem. Second, for p = cl °^" with c small enough, there will be sufficient number of 
"isolated" variables and by simply setting k of these variables to TRUE and the rest of the variables 
to FALSE, we obtain a weight-/c satisfying assignment. It is interesting to see what will happen if 
these isolated variables have been removed. 
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6 Appendix A - Generalization to the Model F%'%{d') 

Consider the model J^^id!) , d! < d, that generalizes the model J^'d- To generate a random 
instance J 7 of Fj!j(d'), we first construct a random hypergraph G(n,p, d) in the same way as with 
the random model J^tj. For each hyperedge {x^ , • • • , Xi d }, we include in J 7 a d-clause selected 
uniformly at random from the set of the <i-clauses over {x^ , • • • , Xi d } that contain at least d! negated 
literals. 

Note that with the above definition, the original model J 7 ^'^ is just ^T'jfl). Similar to the 
analysis for J 7 ^'^ presented in fTTTl . the following threshold behavior of the solution probability can 
be established 

Lemma 6.1 Consider a random instance (^'^(d 1 ) , k) of WEIGHTED d-SAT. Let p = with 
c > being a constant and let c* = a,d{d — d') \ with being the number of d-clauses over a fixed 
set of d variables that contain at least d' negated literals. We have 

limP |^ d{d f ) is satisfiable j = 

For p = ^gz^r , the algorithm W-SAT can be adapted to solve a random instance of J^'^d!) in 
0(k 2 n + n°W)n( d '- 1 ) time by using the following generalization of a fc-frozen variable: 

Definition 6.1 Let (J 7 , k) be an instance of WEIGHTED d-SAT where T is a d-CNF formula and k 
is the parameter. Let 2 < d' < d be a fixed integer. 

Consider a variable x, a set of (<f — 1) variable S = {x\, • • • , x^'-i}, and a collection of 
subsets of variables y = {Yi, 1 < % < k} where 

Yi = { yij ,l<j<(d-d')} 

is a subset of X \ ({x} U S). We say that the collection y of subsets of variables freeze x on S if 

1. YiHYj = 4>,Vi,j. 

2. for each 1 < i < k, the clause 

x\ V • • • V Xd'-i V x V ya V • • • V y^d-d') 

is in the formula T. 

Lemma 6.2 Ifx is k-frozen on S = {x\, • • • , Xd'-i}, then assigning all the variables in S to TRUE 
forces x to be FALSE. 

The modification of W-SAT to solve random instances of Fu'd(d') is as follows: For each of 
the ( d ,™ J possible sets of (d' — 1) variables S = (x\, • • • , Xd'-i), set them to TRUE and all the 
variables that are fc-frozen on 5* to FALSE; Apply the subroutine REDUCEQ to obtain a reduced 
formula J 7 '; Use the same technique in W-SAT to check to see if J 7 ' has a satisfying assignment of 
weight k — (d! — 1). The overall running time is 0(k 2 n + n°^)n^ d 



f 1, ifc<c*, 
\ 0, ifc > c* 
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7 Appendix B - Random Instances of MINI- WEIGHTED d-SAT 



In the proof in Section 4 and in this section, we use the following Chernoff bound 

Lemma 7.1 Let I be a binomial random variable with expectation \x. We have 

_£ 

F{\I-n\ >t} <2e 3m. 

As a variant of WEIGHTED d-SAT, the problem MINI- WEIGHTED d-SAT with parameter k 
asks if for a given d-CNF formula, there is a satisfying assignment of weight k log n. For random 
d-CNF formula ffig, the algorithm W-SAT for MINI-WEIGHTED d-SAT needs to be adapted 
to make use of the existence of k log n-frozen variables. To guarantee that W-SAT still succeeds 



with high probability, a result similar to Proposition |4.1| is needed. This amounts to showing that 
the probability for a variable a 
k2 d ~ 1 (d - 1)!, this is the case. 



the probability for a variable x to be k log n-frozen is small enough. For p = cl °^f with c > 



Theorem 2 There is an 0(k 2 n + n°^)-time algorithm that solves with high probability a random 
instance k) of MINI-WEIGHTED d-SAT for any p = with c > k{2 d -l)(d- 1)1 



Proof. The proof is almost the same as the proof of Proposition 4. 1 except that we need to establish 
an upper bound on the probability that a variable is not A; log n-frozen. For c > k(2 d — l)(d — 1)1, 



Lemma 7.1 on the tail probability of a binomial random variable is still effective and the arguments 



made in the second half of the proof of Lemma 4.2 and in the proof of Lemma [43] are still valid. 



The only difference is the accuracy of the upper bound. In this case, we have P {B} < 0(1) ^jt^ 

(k-c) 2 c 
^3(2 d -l)(d-l)! ' 



(k—c) 2 c 

where < 6 < min( „, yA ,_ 1V , 1), and this is sufficient for the result to hold. 
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